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ABSTRACT 

We evaluate the accuracy of semi-analytic merger-trees by comparing them with the 
merging histories of dark-matter halos in TV-body simulations, focusing on the joint 
distribution of the number of progenitors and their masses. We first confirm that the 
halo mass function as predicted directly by the Press-Schechter (PS) model deviates 
from the simulations by up to 50% depending on the mass scale and redshift, while 
the means of the projected distributions of progenitor number and mass for a halo of a 
given mass are more accurately predicted by the Extended PS model. We then use the 
full merger trees to study the joint distribution as a function of redshift and parent-halo 
mass. We find that while the deviation of the mean quantities due to the inaccuracy of 
the Extended PS model partly propagates into the higher moments of the distribution, 
the merger-tree procedure does not introduce a significant additional source of error. 
In particular, certain properties of the merging history such as the mass ratio of the 
progenitors and the total accretion rate are reproduced quite accurately for galaxy 
sized halos (~ 10 12 M Q ), and less so for larger masses. We conclude that although there 
could be ~ 50% deviations in the absolute numbers and masses of progenitors and in 
the higher order moment of these distributions, the relative properties of progenitors 
for a given halo are reproduced fairly well by the merger trees. They can thus provide a 
useful framework for modelling galaxy formation once the above-mentioned limitations 
are taken into account. 
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1 INTRODUCTION 



F or example, the model developed by Press fc Schechtei 
(1974| ) provides a simple but relatively effective framework 



In the standard picture of hierarchical structure formation, 
small objects form first and then merge together to form 
larger objects as time progresses. Hierarchical structure for- 
mation is a generic prediction of the Cold Dark Matter 
(CDM) family of models, being a natural consequence of the 
general shape of the power spectrum. The process of struc- 
ture foiimalion can be lepieseiilud by Lire "merging lrisloiy' 1 



of ■ 

identifi ed at an earlier time when they were 3till distinct cn - 

tities, that will later merge together to form a larger halo. 
This merging history is often referred to as a "merger tree" . 
This whole conceptual framework is somewhat artificial, re- 
lying as it does on a definition of "halo", which involves 
assumptions of sphericity, virialization, etc. These proper- 
ties almost certainly do not apply to all of the objects in 
the rea 
reason 



for the description of the mass history of particles in a 
hierarchical universe with Gaussian initial perturbations. 
The original Press-Schechter (PS) model predicted the mass 
function of halos as a function of redshift, i.e. the number 
density of halos of a given mass at a redshift z. This predic- 
tion has been tested against th e results of iV-body simula- 
tions in several previous papers (pfstathiou et al. 1988|: jGcllJ 
Bcrtscliinger 1994 lLaccy fc Cole 1994 Ma 19961 ; [TozziT 



Governato 1997] ; |Gross et al. 1998| ; [Tormen 1998; ). The Press- 
Schechter theory was extended to give the conditional prob- 
ability that a particle in a halo of mass Mo at zq was in a halo 
of mass Mi at an earlier redshift z\, leadi ng to an expres- 
sion for the c onditional halo mass function (Bond et al. 1991; 



uuiveise Lhal we might like Lo sLudy. Ilowevei, one 
for ita popularity io that it onabloo one to develop 



Bower 199l[ ). This extended Press-Schechter (EPS) formal- 
ism can also be manipulated to obtain expressions for halo 
survival ti mes, formation times, and merger rates (Lacey & 
Cole 1993| , hereafter LC93). The extended Press-Schechter 



analytic and semi-analytic predictions for many quantities 
of interest. 



model and its implications have n ot been quite as well stud- 
ied as the standard PS model, but Lacey & Cole (1994) and 
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recently Tormen (1998) have presented comparisons of the 



EPS model with TV-body simulations. 

However, the EPS formalism only provides us with 
mean quantities. In order to construct realizations of in- 
dividual dark matter halo merging histories, one must in- 
clude some additional assumptions. Although several meth- 



ods for constructing merger trees have been proposed ( Cole 



Kais> ;r 1988[ [Kauffmann fc White 199$ |Somerville & Ko- 
latt 19£gJ7 all of them involve some ad hoc ingredients. Such 
a method should be designed to reproduce both the condi- 
tional mass function predicted by the EPS model and to 
enforce conservation of mass at every stage. However, these 
constraints are not necessarily consistent with one another. 
This leads to certain ambiguities and subtleties in the pro- 
cess of constructing the merging trees, which we will discuss 
later in this paper. For a fuller discuss ion of these issues, 
however, see |5omcrville fc Kolatt (1998) , hereafter SK). 



These merger trees form the framework for semi 
ana lytic models of galaxy formation such as th o se described 



by Kauffmann, White. 



(19940and Bomerville 



fc Guiderdoni (1993|) , |Cole et al 
5 Primack (1998|). The properties of 



the galaxies created in these models presumably depend to 
some degree on the merging history of the dark matter halos. 
Most of the previous work using the semi-analytic models 
has focussed on reproducing or predicting mean quantities 
and qualitative trends. However, it would be useful to inves- 
tigate whether the broader properties of the predicted dis- 
tribution of model galaxies are consistent with observations. 
For example, we might wish to investigate the scatter in well 
known observational relations like the luminosity-linewidth 
relation, the color-magnitude relation, or fundamental plane 
relations. But before we can trust the models for evaluating 
these kinds of questions, we must make sure that the merger 
trees not only satisfy the mean properties readily predicted 
by the EPS model, but ideally also the full distribution func- 
tion. This has not been thoroughly investigated in previous 
work on this subject, and is the primary goal of this paper. 
The PS and EPS formalism do not provide any information 
about this distribution. Therefore we are forced to appeal 
to iV-body simulations. 

In this paper, we present a comparison between iV-body 
simulations and semi-analytic merger trees (SAM-trees), as 
well as the analytic predictions of the EPS theory when 
they apply. We address several issues. First, we re-examine 
the agreement of the direct predictions of the standard and 
extended Press-Schechter model with the simulations. Al- 
though much of the previous work on this topic has stressed 
the unexpectedly good agreement, we focus on the discrep- 
ancies, their broader implications, possible causes and po- 
tential resolutions. We then study the success of the merger- 
tree method in the reconstruction of the distribution of pro- 
genitor number and mass compared to the results of the 
simulations, and examine several statistics relevant to the 
study of galaxy formation in a hierarchical context. 

The outline of the paper is as follows. In we give 
a brief summary of the PS and EPS formalism. In we 
summarize the merger-tree method. In sQ, we describe the 
iV-body simulations. In ^ we present the comparison be- 
tween the simulations and the SAM-trees. We discuss our 
results and conclude in Spl 



2 THE PRESS-SCHECHTER FORMALISM 

Suppose that we have smoothed the initial density distri- 
bution on a scale R using some spherically symmetric win- 
dow function Wm{t), where M(R) is the average mass con- 
tained within the window function. There are various possi- 



ble choices fo r the form of the window function (c.f. Lacey 
k Cole 1993| ). We use a real-space top-hat window func- 
tion, W M (r) = e(7?-r)(47r7? 3 /3) _1 , where 6 is the Heavi- 
side step function. In this case M = 4-KpoR 3 /3, where po is 
the mean mass density of the universe. The mass variance 
S(M) ee <r 2 (M) may be calculated from 



a(M) 



1 

2^ 



P{k)W 2 {kR)k 2 dk, 



(1) 



where P(k) is the power spectrum of the matter density 
fluctuation, and W(kR) is the Fourier transform of the real 

space top-hat. 

The "excursion set" derivation due to Bond et al. (1991) 
leads naturally to the extended Press-Schechter formalism. 
The smoothed field S(M) is a Gaussian random variable with 
zero mean and variance S. The value of 5 executes a ran- 
dom walk as the smoothing scale is changed. Adopting an 
ansatz similar to that of the original Press-Schechter model, 
we associate the fraction of matter in collapsed objects in 
the mass interval M, M + AM at time t with the fraction 
of trajectories that make their first upcrossing through the 
threshold u; = 5 c (t) in the interval S, S + dS. This may 
be translated to a mass interval through equation (|l|). The 
threshold S c (t) corresponds to the critical density at which 
a pertubation will separate from the background expansion, 
turn around, and collapse. It is extrapolated using linear the- 
ory, 5 c (t) = S c ,o/Du n (z ), where Dim (2) is the linear growth 
function (Peebles 198C). The halo mass function (here in the 
notation of LC93) is then: 



f(S,<j)dS: 



exp 



CO 

~2S 



dS. 



(2) 



The conditional mass function, the fraction of the tra- 
jectories in halos with mass Mi at z\ that are in halos with 
mass Mo at zo (Mi < Mo, zo < zi) is 



1 (u>i 



2^ (Si - s Q )y- 



■ exp 



(ui — ujq) 2 
"2(5i -So) 



dSi. (3) 



The probability that a halo of mass Mo at redshift zo had 
a progenitor in the mass range (Mi, Mi + dMi) is given by 
(LC93): 

dP 

-(Mi, 21 I M ,« )dMi = 



dMi 



^/(Si^i I So,u ) 



dS 



dM 



dMi , (4) 



where the factor Mo /Mi converts the counting from mass 
weighting to number weighting. 

We can now derive two more quantities that will be 
useful later. Given the mass of a parent halo Mo and the 
redshift step zo — > z\, the average number of progenitors 
with mass larger than Mi is: 

N=(N P (M\M ))= j ° dM^-^(M, Zl \Mo,zo). (5) 
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We can also readily calculate the average fraction of Mo that 
dwelt in the form of progenitor halos of mass M > Mf. 



f P = 



dP 

dM—(M, Zl \M ,zo) 



(6) 



and define the complimentary quantity for the average frac- 
tion of Mo that came from "accreted" mass, / acc = 1 — f p . 

Throughout this paper we use the standard value for the 
collapse threshold, <5 Cj o = 1.69, corresponding to the value 
predicted by the top-hat spherical collapse model. 



3 THE TREES 

We first should clarify some of our terminology. Suppose we 
have identified a halo of a given mass at some redshift, whose 
merging history we wish to know. This halo is referred to 
as the "parent". We then go backwards in time and trace 
all the smaller halos that merged together in order to form 
this halo. These are referred to as "progenitors". Although 
it violates the usual temporal relation of kinship, because 
we construct the tree working backwards in time, this has 
become the usual terminology. 

Let us suppose that we set out to construct a merger 
tree, taking the EPS formalism as an ansatz. Such a merger 
tree should satisfy at least two basic requirements: each indi- 
vidual halo history should conserve mass at every stage, and 
the EPS and PS mass functions (for every redshift) should 
be reconstructed in the mean for a large ensemble of realiza- 
tions. Unfortunately, satisfying both requirements exactly 
and simultaneously is not straightforward, and is sometimes 
impossible for certain choices of power spectrum and red- 
shift interval. There are also additional problems - the EPS 
formalism gives us the single halo probability that a halo 
with mass Mo at zo has a progenitor of mass Mi at z\ — it 
does not provide us with any information about the rest of 
the progenitors that make up the mass of Mo (i.e. the joint 
progenitor probability function). There are certain obvious 
constraints which the EPS formalism knows nothing about, 
for example, the sum of the progenitor masses clearly cannot 
exceed Mo. In addition, since the number of halos diverges 
at very small mass, one must impose some minimum mass 
Mi, below which one no longer follows the history of these 
halos. However, one must account for the mass contributed 
by these small halos. We will refer to halos with masses 
larger than M% as "progenitors", and the mass contributed 
by halos smaller than this limit as "accreted mass". 

One must therefore form a compromise between mass 
conservation and reconstructing the mean as predicted by 
the EPS model, and one must make some ad-hoc assump- 
tions in or der to fill in the gaps ment ioned above. In the 



method of Kauffmann & White (1993), the mean progeni- 



tor mass distribution is reproduced exactly, and mass con- 
servation is enforced only a pproximately. In the scheme of 
Somerville & Kolatt (1995), mass conservation is enforced 



exactly and the mean progenitor distribution is reproduced 
approximately. 

We now sketch the method of SK. Given a parent halo 
with mass Mo, the histories of individual particles, or tra- 
jectories, in the language of the excursion set formalism, are 
followed back in time using very small timesteps. For each 
particle that finds itself in Mo at Zq, we choose the mass 



of the progenitor halo it was located in at an earlier red- 
shift zi from Eqn. |j| A "branch" occurs when at least two 
trajectories are found in halos with masses greater than the 
minimum halo mass Mi . The timestep has been chosen such 
that "branching" events are improbable; i.e. in most cases 
one trajectory will be in a halo with mass close to Mo and 
others will be in halos with masses less than Mi . This ensures 
that most of the branching events involve small numbers of 
branches, but no limit is imposed on the number of branches 
allowed. In order to avoid choosing a progenitor with a mass 
larger than the remaining unallocated mass, the probabil- 
ity function is truncated for masses larger than this value. 
The contribution from "accreted mass" is included explic- 
itly from trajectories that find themselves in halos smaller 
than Mi. It is possible to include these without introducing 
a large computational overhead because the mass weighted 
probability, used for the trajectories, does not diverge at 
small mass. One continues to follow trajectories until all of 
the mass of the parent has been accounted for. Therefore 
mass is conserved exactly. However, the shape of the condi- 
tional mass function that is obtained, although very similar 
to the EPS prediction we wish to reconstruct, is not exact 
(see SK). 



4 THE SIMULATIONS 

In order to study the formation histories of halos in numer- 
ical simulations, we need: 

(i) High mass and force resolution 

(ii) A large enough volume to contain many halos in the 
mass range of interest 

(iii) Stored data for several time steps, at appropriate red- 
shifts. 

We use the adaptive particle-part icle /particle-mesh si mula- 



tions of the VIRGO consortium (Jenkins et al. 1997). The 
resolution and box size of the smaller volume simulations in 
the suite are appropriate for our purposes, since we wish to 
focus on galaxy or group sized halos. 

We present our results for the rCDM model, a criti- 
cal (il = 1) model with shape parameter P = 0.21. This 



galaxy clustering ( 


Maddox et al. 1990|; 


Maddox, Efstathiou, 


& Sutherland 1996 


; Vogeley et al._1992 


). The parameters of 



same analysis for a simulation with flo = 0.3 and an open 
geometry, and we find qualitatively similar results to those 
we will describe for the rCDM model, leading to similar con- 
clusions. We therefore do not present the results from other 
cosmological models. 

The halos are identified using a standard "friends-of- 
friends" algorithm with a linking length 6 = 0.2d, where d is 
the mean interparticle spacing d — Lbox/256. The friends- 
of-friends algorithm with this value for the linking length has 
been shown to give roughly equivalent results to spherical 



over-density based halo finders ( Lacey & Cole 1994 ) . 

The procedure to identify progenitors and assign their 
masses is as follows. Consider a parent halo with mass Mo 
at z = 0. We track each particle in this halo to an earlier 
redshift, Zi, and find the mass of the halo in which it is lo- 
cated at that time. We then identify all the other particles 
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Table 1. Simulation Parameters. From left to right, the quantities given are the mass density in units of the critical density, the Hubble 
parameter h = i/o/(100km s _1 Mpc -1 ), the linear rms mass variance on a scale of 8h~ 1 Mpc, the number of particles, the number of 
grid cells, the length of one side of the box, and the mass per particle. 



Model n 


h as 


N p 


N g L b ox (Mpc) 


m p (M ) 


tCDM 1.0 


0.5 0.6 


256 3 


512 170 


2.0 x 10 10 



in the progenitor halo that are also present in the parent 
halo Mo at z — 0. At this point, we may be faced with sev- 
eral problems which reflect the inconsistency of the simple 
merging hierarchy picture with the numerical properties of 
iV-body simulations. We may encounter the unphysical sit- 
uation where a progenitor halo identified in this way has a 
mass larger than the parent. This is simply because indi- 
vidual particles are often stripped or ejected from halos in 
the simulations. If this occurs, we exclude the parent halo 
from our sample. In addition, not all the particles in the pro- 
genitor end up in the same parent halo. We then have two 
possible choices of ways to assign mass to the progenitor. We 
can either take only the mass of all the particles in the pro- 
genitor that also end up in the parent halo, or we can take 
the total mass of the progenitor halo. In the first case, the 
mass does not reflect the dynamical mass of the progenitor, 
which is the quantity that is treated by the PS formalism. 
In the second case, conservation of mass is violated. Since 
it is not clear which is the most consistent prescription, and 
since these two choices in some sense represent opposite ex- 
tremes, we have performed our analysis using both schemes. 
We shall refer to the masses identified in these two ways as 
the "inclusive" (mass of particles that end up in parent) and 
"total" (total virial mass of progenitor) progenitor masses. 

Recall that in the SAM-tree procedure, we defined a 
minimum progenitor mass Mi. In the case of the trees, this 
mass scale is arbitrary, but in the simulations, we should 
choose it to correspond to the smallest halo that we believe 
to be numerically stable. We have tried using a minimal halo 
mass corresponding to both 10 and 25 particles. We show 
the results for the choice of 25 particles because the agree- 
ment between Press-Schechter and the simulations is better, 
and because the results for the two different definitions of 
progenitor mass discussed above are more consistent with 
one another. This leads us to believe that the results are 
less affected by the numerical instability of the halos if we 
only consider halos of more than 25 particles. As in the trees, 
we now term the total amount of mass contributed by halos 
with masses smaller than Mi as the "accreted mass". The 
estimate of the accreted mass also depends on the method 
used to assign progenitor masses. For the "inclusive" scheme, 
the mass of the halo is more likely to fall below M; and hence 
the estimate of accreted mass tends to be larger. 



5 THE COMPARISON 
5.1 Mean Quantities 

In this section, we compare the results of the simulations 
described in the last section with the predictions of the 
PS and EPS models^ and with the results of the SAM- 
trees described in sM In the remainder of the paper, we 



log M [M ] 



10 12 14 16 




log(M/M,) 

Figure 1. The mass function of halos predicted by the standard 
Press-Schechter model (dashed lines), and found in the simula- 
tions (solid lines) at redshifts of (from top to bottom) z = 0, 0.2, 
0.5, 1.1, 2.1, and 3.0. 



will express all masses in terms of the minimum progeni- 
tor mass, Mi, which here corresponds to a physical mass of 
25 m p = 5.0 x 1O 11 M0. In order to have sufficient statistics, 
we group the parent halos in the simulations into ten loga- 
rithmically spaced mass bins, spanning the mass range from 
5 Mi to 500 Mi . We generate the S AM-trees for a grid of par- 
ent masses covering the range of Mo in each mass bin used 
for the simulations, weighting the contribution according to 
the Press-Schechter number density at z — 0. This is an at- 
tempt to mimic the range of parent masses represented in a 
given bin in the simulations, and accordingly any scatter in 
the distributions that may result from this effect. 

Figure |l| shows the overall halo mass function for the 
simulations, compared with the standard Press-Schechter 
model. At z = 0, the PS model agrees well with the simula- 
tions at high masses (;> 1O 14 M0), and overpredicts the num- 
ber of halos by about a factor of two between masses of 10 14 
and ~5x 10 n A/©. At very small masses (< 2 x 10 n M Q ), 
the simulations have many more halos than the PS model 
prediction, which is almost certainly due to numerical ef- 
fects. 

The scatter in the simulation mass function due to shot 
noise is insignificant on the mass scales where the discrep- 
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Figure 2. The conditional mass function of halos predicted by 
the Extended Press-Schechter model (solid lines), found in the 
simulations (filled squares), and in the SAM-trees (dashed lines). 
The Extended Press-Schechter prediction and the SAM-trees have 
been averaged over the bin in parent mass used for the simula- 
tions, indicated by the vertical lines on the bottom of the box. 
The panels show redshifts 0.2, 0.5, 1.1, and 2.1 as indicated on 
the figure. The mean parent mass is Mq = 10M; . 



ancy with the Press-Schechter model has been noted (at 



z = 0, less than 5 percent for M <J 10 Mq, less than 10 per- 
cent for M ~ 10 14 M Q ). If plotted as error bars on Fig. |l], the 
width of the error bars on these scales would be about the 
same as the width of the line used to plot the mass function. 
The shot noise becomes significant (greater than 20 percent) 
only for the two largest mass bins (M > 5 x 10 14 ), which 
contain only a few halos because of the limited volume of 
our simulation box. 

The mass scale at which the PS model starts to under- 
estimate the number of halos moves to smaller values with 
increasing redshift. This means that by a redshift of z ~ 3, 
the simulations have considerably more halos than the PS 
prediction at masses > 10 12 Mq; i.e. the evolution of the 
mass function with redshift is exaggerated in the PS model. 
However, the mass scale of the steepening on the small mass 
end remains constant with redshift, as expected if it is in- 
deed due to numerical effects. 



It should be noted that Gross et al. (1998) and Gross 
(1997^~|find the similar results (both the nature and mag- 
nitude of the discrepancy and its dependence on redshift) 
using a different kind of code (Particle-Mesh) and a dif- 
ferent method for identifying halos (spherical and ellipti- 
cal overdensity methods). The same results are obtai ned in 
all five of th e CDM-variant cosmologies simulated by Gross 
et al. (1998). Si milar results have also been obtained by Tor- 
men (1998j) and Tozzi & Governato (1997). It should also be 



noted that changing the value of the parameter 5 c ,o will only 
cause the PS mass function to cross the simulation mass 
function at a different mass scale and cannot bring the mass 
functions into good agreement simultaneously on all scales. 



Figure ^| shows the conditional mass function for a bin in 
parent halo mass Mo = 7.9—12.6 Mf, i.e. the probability that 
the parent halo had a progenitor of mass Mi at some ear- 
lier redshift Zj . The geometric mean parent mass is approxi- 
mately 10 Mi , which with our choice of Mi = 5.0 x 10 11 , cor- 
responds to the mass of the halo surrounding an L, galaxy 
(M = 5.0 x 10 12 Af©). In addition to being a relevant mass 
scale if we are interested in studying galaxy formation, there 
are many such halos in our simulation volume, so that shot 
noise is insignificant for the remainder of the comparisons 
that we shall discuss. Figure ^| shows the prediction of the 
EPS theory (Eqn. the SAM-trees, and the results of the 
simulations. The EPS prediction and the SAM-trees have 
been averaged over the bin in parent halo mass used for 
the simulations. We find that the simulation results are rea- 
sonably similar for the two progenitor mass identification 
schemes described in Section |i|, so we simply average the 
values obtained in the two schemes for the remainder of the 
paper. 

Note that the discrepancy between the EPS theory and 
the simulations is qualitatively similar to the discrepancy 
that we saw before between the standard PS and the total 
mass function. The simulations have fewer halos than EPS in 
the intermediate mass range, and (though this is not shown 
in the figure) steepen at very small masses (< Mi). We also 
see the same reversal in the trend as a function of redshift. 
The peak near Mo is moved to somewhat smaller masses. 
This means that the largest progenitor in the simulations 
is somewhat smaller than in the the EPS-based trees, and 
the complementary accreted mass is larger. Similar effects 



are seen for cluster-sized halos in the simulations of Tormen 



(1995 



5.2 Distributions 

We now address the distribution of progenitor mass and 
number, i.e. the probability that a halo of mass Aio at zq 
had a progenitor at Z\ with mass M p , and that this pro- 
genitor was a member of a set of n progenitors. We denote 
this as P(n, M p ). We show this distribution in figures ^-^[ 
The number of points plotted in each bin is proportional to 
P(n,M p ). We have offset the points randomly within the 
bins for clarity, although n can actually take only integer 
values. We show the results for parent halos in mass ranges 
centered on M = 10 Mi and Mo = 40 Mi, for both the 
simulations and the SAM-trees. Note that the smaller mass 
parent corresponds to the mass of the dark matter halo sur- 
rounding an isolated L* galaxy, and the larger mass parent 
to the dark matter halo surrounding a group. 

These distributions show qualitatively the same be- 
haviour for the SAM-trees and the simulations. At z — 0.2 
most of the progenitors are single progenitors with masses 
close to that of the parent, with the rest of the mass com- 
ing from accretion (halos smaller than Mi). As we go back 
to progressively higher redshifts, the mean progenitor mass 
shifts towards smaller masses, but the mean number of pro- 
genitors stays roughly constant. With some imagination, one 
can see the shape of an elephant head in these plots: the wide 
roundish part on the left is the "head" and the curved plume 
extended out to the right is the "trunk" (especially notica- 
ble in the larger mass parent halo shown (figure^)). At low 
redshift, there is a clump of progenitors with mass close to 
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Figure 3. The two-dimensional distribution of the number of 
progenitors and their masses at redshifts 0.2, 0.5, 1.1, and 2.1, 
for parent halos with average mass Mo = 10 Afj. The points have 
been plotted with random offsets within the bins. The top panel 
shows the simulations, and the bottom panel show the SAM-trees. 



Mo and small numbers of progenitors, which broadens as 
it moves towards larger numbers of smaller progenitors. As 
we move to higher redshift, this clump at the end of the 
"trunk" drains further back into the "head" , and the trunk 
becomes less and less pronounced, until by z = 2 there is 
almost no trace of it. The "trunkiness" (i.e. the presence of 
the thin extended plume) persists longer (until higher red- 
shift) for larger mass parents. Elephants with trunks corre- 
spond to a situation where halos are formed from several 
progenitors, and in this case we would expect to find several 
galaxies within a single dark matter halo. Elephants with- 
out trunks (the more rounded distributions) correspond to 
a situation where there is often a single progenitor that has 
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Figure 4. Same as figure H for Mq = 40 M(. 



mostly grown by accreting mass. Thus in a very general way, 
we can understand from these distributions how the merging 
history of an isolated L» galaxy differs from one living within 
a group or cluster. This characteristic shape is a direct result 
of the combined constraint of the progenitor mass distribu- 
tion (figure ^) and mass conservation. Thus any merger tree 
method that satisfies these constraints should reproduce the 
general properties of the mass-number distribution, regard- 
less of the details of the method. 

When we try to compare the distributions more strin- 
gently, we immediately notice several differences. In the first 
redshift interval, the SAM-trees have progenitors filling in 
all the mass bins up until the largest bin. This indicates that 
there are progenitors with masses very close to Mo, which 
have accreted a very small amount of mass. In the simu- 
lations, the largest progenitor mass bins are almost empty. 
This is a reflection of the same discrepancy that we saw in 



© 0000 RAS, MNRAS 000, 000-000 



Evaluating Semi- Analytic Halo Merging Histories 7 



the conditional mass function (figure — the peak in pro- 
genitor mass near Mo was shifted towards smaller masses 
in the simulations. This means that the largest progenitor 
is smaller and the average accreted mass is larger in the 
simulations. Another, related difference in the P(n, M p ) dis- 
tributions is that in the SAM-trees, the "elephant head" is 
more diffuse and is shifted towards larger numbers of pro- 
genitors. The elephants also lose their trunks faster as one 
goes to higher redshift in the SAM-trees (indicating that the 
evolution is faster). This again mirrors what we saw in the 
conditional and overall mass functions: the simulations have 
more high-mass halos at high redshift than the EPS model 
predicts, indicating less evolution. These discrepancies be- 
come more and more noticeable for larger parent masses. We 
therefore see that the discrepancies in the mean quantities 
predicted directly by the PS and EPS theory compared with 
the simulations lead to discrepancies in the merging history 
of the halos obtained from the EPS-based SAM-trees. 

The complement of the quantity presented above is 
the distribution of progenitor number and accreted mass, 
P(n, /acc). This is the probability that a halo with mass Mo 
had n progenitors and / acc = M acc /Mo in accreted mass. In 
figure pi we indeed see a behaviour that is complementary 
to that we have just discussed with relation to the distribu- 
tion P(n,M p ). The fraction of the parent mass in the form 
of accreted mass is larger in the simulations, and there is 
less scatter in both / acc and N p than in the SAM-trees. The 
accreted mass increases as we move to higher redshift, re- 
flecting the fact that a larger fraction of the mass in the 
universe is in the form of very small halos. The number of 
progenitors increases until a redshift of about z = I , when 
the number decreases as more of the progenitors fall below 
the mass resolution. 



5.3 Projections 

The "trunk" plots discussed in the previous section pro- 
vide a qualitative impression of the evolution of the dis- 
tribution we are interested in, but they are somewhat diffi- 
cult to interpret quantitat ivel y. Note that the conditional 
mass function shown in 



5.1 



P(M P ), 



is a projection of 
J2 n P(n,M p ). We 



the distribution P(n,M p ): P(M P ) 
can also look at the orthogonal projection, i.e. the prob- 
ability of having n progenitors, regardless of their mass, 
P(n) = J** P(n, M p )dM p . We show this quantity for the 
simulations and the SAM-trees in figure ^j. 

The agreement of the distributions in the smaller mass 
bin, Mo — 10 Mi is quite good. For the larger mass bin, 
Mo = 40 Mi , the trees are skewed towards larger numbers 
of progenitors at z — 0.5, agree almost perfectly with the 
simulations at z = 1, and are skewed in the opposite direc- 
tion (towards smaller number of progenitors) at z = 2. The 
sense of these discrepancies and the reversal of the trend 
with redshift should be quite familiar by now and are a di- 
rect result of the discrepancies between the EPS model and 
the simulations that we have already remarked upon. 

Similarly, the projection of the distribution P(n,/ acc ) 
gives the probability that a fraction of the parent mass / aC c 
was in the form of accreted mass, regardless of the number 
of progenitors. This is shown in figure |?]. Once again we can 
compare the EPS prediction for the mean value of /acc, from 
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Figure 5. The two-dimensional distribution of the number of 
progenitors and the fraction of the parent mass in the form of 
accreted mass, at redshifts 0.2, 0.5, 1.1, and 2.1, for parent halos 
with average mass Mo = 40 Mi . The points have been plotted 
with random offsets within the bins. The top panel shows the 
simulations, and the bottom panel shows the SAM-trees. 



equation ^[ and shown on the figures by the full vertical line, 
with the means of the distributions in the simulations and 
the SAM-trees. The mean accreted mass in the simulations is 
larger than the EPS prediction at small redshift and smaller 
at higher redshift. This is once again a consequence of the 
original discrepancy mentioned above. 



5.4 Moments 

In figure |^, we show the first moment and rescaled second 
and third moments of the distribution of the number of pro- 
genitors P„ as a function of the parent halo mass, Mq. The 
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Figure 6. The probability that a halo Mq at z = had n pro- 
genitors, at redshifts 0.2, 0.5, 1.1, and 2.1. The square symbols 
show the results from the simulations, and the histogram shows 
the SAM-trees. The full vertical line shows the mean predicted by 
the EPS model. The short lines on the bottom of the boxes show 
the means of the simulation distributions, and the short lines on 
the top of the boxes show the means for the SAM-trccs. top panel: 
Mq = 10M ; bottom panel: Mo = 40M; 



Figure 7. The probability that a fraction / acc of the parent mass 
was in the form of accreted mass, at redshifts 0.2, 0.5, 1.1, and 
2.1. The curves show the results from the simulations, and the 
histograms show the SAM-trees. The full vertical line shows the 
mean predicted by the EPS model. The short lines on the bot- 
tom of the boxes show the means of the simulation distributions 
(inclusive and total), and the short line on the top of the boxes 
shows the mean for the SAM-trees. top panel: Mo = 10M;; bot- 
tom panel: Mq = 40M; 



rescaled second and third moments are the usual second and 
third moments divided by the first moment, such that they 
would be equal to unity for a Poisson distribution. Bold lines 
are for the SAM-trees and thin lines are for the simulations. 
In the first redshift step (z — 0.2), the moments agree at 
low masses (M w Mi), but diverge for larger M . The di- 
vergence is in the sense that the trees have a larger first, 
second, and third moment than the simulations. In the next 
redshift step, we can see the hint of the beginning of the re- 
versal of the sense of the discrepancy, at the low Mo end. The 
distribution from the SAM-trees is still somewhat broader 



(larger second moment) but now is less skewed (smaller third 
moment) than the simulations. The moments seem to cross 
on all mass scales at about z ~ 1 and by z — 2 the mean 
number of progenitors is smaller for the SAM-trees on all 
mass scales. The second and third moments are still larger 
in the trees than in the simulations. 
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Figure 8. Moments of the distribution of the number of progen- 
itors P n , as a function of parent halo mass Mo, for redshifts 0.2, 
0.5, 1.1, and 2.1. Dashed lines with open symbols show the results 
from the SAM-trees, and solid lines with filled symbols show the 
results from the simulations. The dots indicate the first moment 
(mean), the squares the rescaled second moments (the usual sec- 
ond moment divided by the first moment), and the triangles the 
rescaled third moments (the usual third moment divided by the 
first moment). The rescaled moments would be equal to unity for 
a Poisson distribution (indicated by the horizontal line). 



5.5 Largest Progenitors 

A statistic which should be important in determining the 
properties of the galaxies that form within dark matter ha- 
los is the mass ratio of the largest and second largest pro- 
genitors. Figure ^| shows the ratio of the second largest pro- 
genitor to the largest progenitor M2/M1, as a function of 
the ratio of the mass of the largest progenitor to the parent, 
Mi /Mo. We find very good agreement between the SAM- 
trees and the simulations in both the mean and the variance 
of this quantity. Apparently, the usual discrepancy in the 
progenitor mass distributions cancels out. As we see from 
the histograms at the bottom of the figure, the mass of the 
largest progenitor is larger in the SAM-trees than in the sim- 
ulations at low redshift, and smaller at higher redshift. This 
clearly follows from the original effect shown in figure |^. 



6 DISCUSSION AND CONCLUSIONS 

Many of the previous comparisons of the Press-Schechter 
model with simulations have emphasized the agreement be- 
tween the two, which is surprisingly good in view of the 
crude assumptions underlying the model. Certainly this 
model has served as a useful tool in the study of structure 
formation. However, as comparisons of theory and observa- 
tions become more refined, it is important to be aware of the 
limitations of the model, and to quantify its inaccuracies. 

There are several implications of the errors associated 
with the PS and EPS model, corresponding to various im- 
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Figure 9. Masses of largest progenitors, for redshifts 0.2, 0.5, 1.1, 
and 2.1. The quantity plotted in the upper part of the figure is 
the mass ratio of the second largest to the first largest progeni- 
tor (M2/M1) vs. the ratio of the largest progenitor to the parent 
mass (Mi/Mo). Solid lines show the results from the simulations, 
and filled dots show the results from the SAM-trees. Dashed lines 
show the l-cr variance in the simulations, and the error bars show 
the l-cr variance in the SAM-trees. The histograms show the dis- 
tribution of the mass of the largest progenitor. Shaded histograms 
are for the SAM-trees, and unshaded histograms arc for the sim- 
ulations. 



plementations of the formalism. First, the overall abundance 
of halos of a given mass may be over- or underestimated by 
the PS model compared to A^-body simulations, depending 
on the mass scale and the redshift. The general sense of this 
discrepancy is that the PS model overestimates the halo 
abundance on mass scales less than ~ 10 14 Mq by about a 
factor of 1.5 to 2.0 at z = 0. The model and simulation re- 
sults agree well on all mass scales at an intermediate redshift 
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z ~ 1, but this agreement is momentary and fortuitous: it 
appears to be simply the point where the two functions hap- 
pen to cross as they evolve in redshift. At higher redshifts, 



that our conclusions are very consistent with the findings of 
Kauffmann et al. (1996), who compared galax y formation 



models based on a different merger-tree method (Kauffmann 



the abi ndance of large mass halos M > 10 is underesti- 



White 199S) with models implemented directly within 



mated by the PS model. Because this is the exponentially 
decreasing part of the mass function, this error can be quite 
serious. This behaviour seems to be qualitatively similar re- 
gardless of the cosmology or power spectrum, although the 
evolution with redshift (e.g. the redshift of fortuitous agree- 
ment) will be somewhat different. The precise magnitude of 
the discrepancy between PS and simulations depends on the 
cosmology, power spectrum, and the implementation of the 
Press-Schechter model (i.e. top-hat vs. Gaussian smoothing, 
value of S c ,o, etc.), but the dependence on mass-scale and 
redshift implies that the discrepancy cannot be removed by 
sim ply adjusting the parameter <5 Cj o (see also Gross et al 
199^) 



The second point is that this discrepancy is echoed qual- 
itatively in the comparison of the conditional mass function 
predicted by the EPS model with the simulations. This in 
turn affects the predictions of merger rates, accretion rates, 
formation times, and the distribution of progenitor masses 
in merger trees based on the EPS formalism. We have shown 
that the distribution of the number of progenitors found in 
the merger trees is in fairly good agreement with the results 
of the simulations. On galaxy scales (M ~ 1O 12 M0) these 
quantities are in excellent agreement. On larger mass scales 
(M ~ 10 13 M©, about the size of a group), the number of 
progenitors is over-estimated by the SAM-trees by at most a 
factor of two. The discrepancy increases further as the mass 
of the parent halo increases. This effect artificially steepens 
the faint-end slope of the luminosity function i n EPS-based 
semi-a nalytic models of gala x y formation (e.g. Kauffmann 
White, I fc Guiderdoni (1993; ), Pole et al. (1994| ), fcomcrvillc 



& Primick (1995)). This contributes to an apparent discrep- 



ancy with the observed slope, which seems to require some 
mechanism that strongly suppresses star formation in small- 
mass objects, such as strong snpernovac feedback. The error 



N-body simulations (the same simulations analyzed here). 
They note the same overall factor of two discrepancy in the 
halo mass function, but find that the luminosity function 
for a halo of a given mass is very similar in the two cases. 
This is presumably because, as we have noted, the relative 
properties of the progenitors for a halo of a given mass are 
in good agreement in the SAM-trees and the simulations. 

Our main conclusions from this study may be summa- 
rized as follows: 

• The Press-Schechter and Extended Press-Schechter 
models exhibit discrepancies with N-body simulations at the 
level of up to 50 percent. This can lead to errors in the mean 
numbers and masses of progenitors and the higher order mo- 
ments of the progenitor number-mass distributions as well 
as the overall and conditional halo mass functions. More se- 
rious errors may arise at high redshifts for the rare objects at 
the exponentially declining high-mass end of the mass distri- 
bution (clusters). These effects must be taken into account 
in any detailed comparison of PS or EPS based models with 
observations. 

• Despite these problems, relative properties of the pro- 
genitor distribution (such as the progenitor mass ratios) for 
a halo of a given mass are quite well reproduced in the 
merger trees, especially on galaxy mass scales. Halo merging 
histories constructed using the EPS formalism should thus 
provide a reasonably reliable framework for semi-analytic 
modelling of galaxy formation. 
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